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Abstract. The goal of this paper is two-fold: 1. We review classical and recent measures of serial 
extremal dependence in a strictly stationary time series as well as their estimation. 2. We discuss 
recent concepts of heavy-tailed time series, including regular variation and max-stable processes. 

Serial extremal dependence is typically characterized by clusters of exceedances of high thresh- 
olds in the series. We start by discussing the notion of extremal index of a univariate sequence, i.e. 
the reciprocal of the expected cluster size, which has attracted major attention in the extremal 
value literature. Then we continue by introducing the extremogram which is an asymptotic auto- 
correlation function for sequences of extremal events in a time series. In this context, we discuss 
regular variation of a time series. This notion has been useful for describing serial extremal de- 
pendence and heavy tails in a strictly stationary sequence. We briefly discuss the tail process 
coined by Basrak and Segers to describe the dependence structure of regularly varying sequences 
in a probabilistic way. Max-stable processes with Frechet marginals are an important class of reg- 
ularly varying sequences. Recently, this class has attracted attention for modeling and statistical 
purposes. We apply the extremogram to max-stable processes. Finally, we discuss estimation of 
the extremogram both in the time and frequency domains. 



1. Introduction 

Measuring and estimating extremal dependence in a time series is a rather challenging problem. 
Since many real-life time series, especially those arising in finance and environmental applications, 
are non-Gaussian their dependence structure is not determined by their autocorrelation function. 
Correlations are moments of the observations and as such not well suited for describing the depen- 
dence of extremes which typically arise from the tails of the underlying distribution. 

1.1. The extremal index as reciprocal of the expected extremal cluster size. Extremal 
dependence in a real- valued strictly stationary sequence (Xt) can be described by the phenomenon 
of extremal clustering. Given some sufhciently high threshold u = Un, we would expect that ex- 
ceedances of this threshold should occur according to a homogeneous Poisson process, if (Xt) is iid. 
On the other hand, for dependent (Xt) exhibiting extremal dependence, exceedances of u should 
cluster in the sense that an exceedance of a high threshold is likely to be surrounded by neighboring 
observations that also exceed the threshold. Although the notion of extremal clustering is intuitively 
appealing, a precise formulation is not so easy. 

The intuition about extremal clusters in a time series can be made precise using point process 
theory. In the classical monograph by Leadbetter, Lindgren and Rootzen |40] the point process of 
exceedances of u was used to describe clusters of extremes as an asymptotic phenomenon when the 
threshold u„ converges to the right endpoint of the distribution F of X. (Here and in what follows, 
Y denotes a generic element of any strictly stationary sequence (Yt).) To be more precise, (m„) has 
to satisfy the condition n F{un) — n{\~ F{un)) t for some r S (0, oo). Under this condition and 
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mixing assumptions, the point processes of exceedances converge weakly to a compound Poisson 
process (see Hsing et al. [3T]): 

n oo 

(1-1) ^^£i/nl{x,>u,,} N = Xl^«^r, , 

i=l i=l 

where the state space of the point processes is (0,1], the points < Fi < r2 < • • • constitute 
a homogeneous Poisson process with intensity 9t on (0, 1] which is independent of an iid positive 
integer-valued sequence (^i). Here 6 G [0, 1] is the extremal index of the sequence (Xt). Thus, in an 
asymptotic way, a cluster of extremes is located at the Poisson points F; with corresponding size . 
The cluster size distribution = k), k > 1, contains plenty of information about the distribution 
of the extremal clusters. However, most attention has been given to determine the expected cluster 
size ES^ which can be interpreted as reciprocal of 6 as the following heuristic argument illustrates. 
Applying on the set (0, 1] and taking expectations on both sides of the limit relation, we 

observe that 

£;iV„(0, 1] = nF{un) -^T = EN {0,1] = E^E#{i > I : < 1} = E^ {6 t) , n oo. 

Thus 9 = 1/E^ with the convention that E£^ = oo for = 0. In the case of an iid sequence, = 1 
a.s., i.e. N collapses to a homogeneous Poisson process and 6 = 1. 
Writing M„ = max(Xi, . . . , we also observe that 

P(M„ < u„) = P(Ar„(0, 1] = 0) ^ P(7V(0, 1] = 0) = P(#{^ > 1 : F, < 1} = 0) = e 

while for an iid sequence (Xt) with the same marginal distribution F as for (Xt) and M„ = 
max(Xi , . . . , Xn) we have 

If F belongs to the maximum domain of attraction of an extreme value distribution H [F € 
MDA{H)) there exist constants c„ > 0, d„ € M, n > 1, such that P(c~^(M„ - d„) < x) H{x) 
for every x € supp(7?) (the support of H); cf. Embrechts ct al. [20], Chapter 3. Thus, writing 
Unix) = CnX + dn and T = t{x) = — logH{x) for X G supp(iJ), the existence of an extremal value 
index 9 of the sequence (Xt) implies that 

(1.2) P{c~\Mn-dn)<x)^ H^ix), xeR. 

The concrete form of the extremal index is known for various standard time series models, in- 
cluding linear processes with iid subexponential noise (cf. [20], Section 5.5), Markov processes 
(see Leadbetter and Rootzen [IT], Perfekt [S3]) and financial time series models such as GARCH 
(generalized autoregrcssive conditionally heteroscedastic) and SV (stochastic volatility) models; cf. 
[Tll|T3l[T4]. Expressions of the extremal index for regularly varying sequences (Xt) (see Section [1.2.2l 
for a definition) in terms of the points of the limiting point process were given in Davis and Hsing 
[5] and in terms of the limiting tail process in Basrak and Segers [2]; see (jl.lOp below. However, for 
most models these concrete expressions of 6 are too complex to be useful in practice. 

An exception are Gaussian stationary sequences (Xt). Writing jx{h) = cov{Xo, Xh), h > 0, 
for the covariance function of (Xt), this sequence has extremal index 6 = 1 under the very weak 
condition jxih) = o(l/log/i) as /i — >■ oo (so-called Herman's condition); see Leadbetter et al. [40j . 
cf. Theorem 4.4.8 in Embrechts et al. [20]. Notice that Herman's condition is satisfied for fractional 
Gaussian noise and fractional Gaussian ARIMA processes (see Chapter 7 in Samorodnitsky and 
Taqqu [M], and Section 13.2 in Brockwell and Davis [S]). Subclasses of the latter processes exhibit 
long range dependence in the sense that J2h \'yx{h)\ = ooQ 



This remark also indicates that long range dependence for extremes should not be defined via the covariance 
function ■yx- As explained above (see l|1.2|l ') the existence of a positive extremal index 9 ensures that the type of 
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We conclude that any Gaussian stationary sequences which are relevant for applications do not 
exhibit extremal clustering in the sense that 6 = 1. If 6' = 1 one often says that {X-t) exhibits 
asymptotic independence of its extremes. However, the notion of asymptotic independence is not 
well defined and may have rather different meanings in the extreme value context, as we will observe 
later. 

Due to the complexity of expressions for the extremal index it has been recognized early on 
that 9 needs to be estimated from real-life or simulated data. Various estimators were proposed 
in the literature. Among them, the blocks and runs estimators are the most popular ones. These 
estimators are non-parametric estimators of 9 which, in different ways, define and count clusters in 
the sample and use this information to build estimators of 9 under mixing conditions. In addition 
to the delicate choice of a threshold u„ , these estimation techniques also involve the construction of 
blocks of constant (but increasing with the sample size n) length or of flexible length depending on 
the local extremal behavior. These estimators often exhibit a rather large uncertainty. 

In Figures [TTTj and fTT^ we illustrate the estimation of 9 for real and simulated data. We choose the 
simple blocks estimator 9 = K^/Nn of the extremal index 9, where Nn is the number of exceedances 
of the threshold u = u„ in the sample Xi, . . . , X„ and Kn is the number of blocks of size s = s„, 
. . . ,Xis, i = 1, . . . , [n/s], with at least one exceedance of u. 

Aspects of bias, variance and optimal choice of blocks for the estimation of 9 were discussed 
in Smith and Weissman [65]. In a series of papers, Hsing [27l EH [29l [30] studied the extremes of 
stationary sequences, including the asymptotic behavior of their extremal index estimators. The 
recent papers Robert [SHI [SO], Robert et al. [SS], in particular (50], give historical accounts of 
estimation of 9 and some new technology for the estimation of 9 and the cluster size distribution 
= k), k > 1. The paper of Robert [60] is devoted to inference on the cluster size distribution. 
The literature on this topic is sparse; Robert [6^ mentions Hsing [22 as a historical reference. 

1.2. The extremogram: an asymptotic correlogram for extreme events. Davis and Mikosch 
[131 117j introduced another tool for measuring the extremal dependence in a strictly stationary M'^- 
valued time series (Xt): the extremogram defined as a limiting sequence given by 

(1.3) 7AB(/i) J™^"cov(/{^-i^^g^j,/^^-i^^g5j), h>Q. 

Here (a„) is a suitably chosen normalization sequence and A, B arc two fixed sets bounded away from 
zero. The events {Xq € OnA} and {X^ € On-B} are considered as extreme ones and jAsih) measures 
the influence of the time zero extremal event {Xq £ a„ A} on the extremal event {X^ G a„ B}, h 
lags apart. The choice of (a„) depends on the situation at hand. To avoid ambiguity, we later 
assume that (a„) satisfies the relation nP{\X\ > a„) ~ 1. With this choice of (a„), jAsih) = 
lim„_yoo n P(a~^Xo € A,a~^Xh G B). Motivating examples of extremograms are the limiting 
conditional probabilities lim,i_).oo Pi^-n^Xh G B \ a~^Xo G A) in Davis and Mikosch [T3l ITT]. 



the limiting extreme value distribution H remains the same as in the iid case. This is easily checked since the only 
possible non-degenerate limit distributions H are the types of the Frechet distribution ^a{x) = , x,a > 0, 

the Weibull distribution >l'a(x) = e~(~^' , a; < 0, a > 0, and the Gumbel distribution A{x) = e~° , x G R. This 
is a consequence of the Fisher- Tippett theorem; cf. Embreehts et al. |20| . Theorem 3.2.3. The notion of long range 
dependence in an extreme value sense would be reasonable if in 1 11.211 a limit distribution occurred which does not 
belong to the type of any of the three mentioned standard extreme value distributions. This, however, can only be 
expected if a given stationary sequence (Xt) with F S MDA(_ff) does not have an extremal index or if 6 = 0. Examples 
of sequences with zero extremal index are given in Leadbetter et al. | 40| and Leadbetter 1391 , but such examples 
are often considered pathological; see also the discussion in Samorodnitsky |63) who studied infinite variance stable 
stationary sequences with zero extremal index and the boundary between short and long range extremal dependence 
for these sequence. 
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Figure 1.1. Blocks estimator of the extremal index 9 for 31, 757 5-minutc log-returns of Bank of 
America stock prices. The blocks estimator as a function of the block size s and the u% upper order 
statistics (top), for fixed u = 1.9% and running s (bottom left) and for fixed s = 10 and running u 
(bottom right). 



A motivating example for d = 1 with A = B ^ (l,oo) is the so-called (upper) tail dependence 
coefficient of the vector (ATq, Xh) given as the limit 

(1.4) p{h) = lim P{Xh >x\Xo>x). 

a;— >oo 

(Here we assume that X has infinite right endpoint.) These pairwise tail dependence coefficients have 
attracted some attention in the literature on quantitative risk management; see for example McNeil 
et al. [42]. Notice that p{h) coincides with 7aa(/i) if we choose (a„) such that nP(Xo > a„) ^ 1 as 
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Figure 1.2. Blocks estimator 9 of tlie extremal index 6 for a sample of size 20 000 from the AR(1) 
process Xt = 0.8Xf_i + Zt- The iid noise (Zt) has a common student distribution with a = 2 
degrees of freedom. The extremal index 6 = 0.37 is known (indicated by dashed line); see pO] . 
Section 8.1. The blocks estimator as a function of the block size s and u% of the upper order 
statistics (top), for fixed u = 2% and running s (bottom left) and for fixed s = 24 and running u 
(bottom right). 



oo. Indeed, 

P{Xh > an,Xo > a„) - (P(Xo > an)f 



nCOv{I{Xo>a„},I{X^>a„]) 



P{Xo > an) 
P{Xh > an \Xo> an). 
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A similar calculation for any dimension d and suitable sets A, B shows that the limiting sequence 

(1.5) (^^^s ^^-;;;h, /.>o, 

inherits the properties of a matrix covariance function. Notice that the entries of these matrices 
cannot be negative. The interpretation of (|1.5p as covariance function allows one to use the classical 
notions of time series analysis in an asymptotic sense. For example, notions such as long or short 
range dependence of extremal events can be made precise by specifying the rate of decay of (jl.Sp as 
/i — )• oo. Davis and Mikosch ^2], Mikosch and Zhao [FT] introduced an analog of the spectral density 
as a Fourier transform of the sequence (|1.5|) . They showed that the periodogram of the sequence 
of indicators I^a^^ Xt^A} '^^ extremal events {a~^Xt e A], t ^ 1,, has properties similar to the 
classical periodogram of a stationary sequence. In particular, weighted averages of the periodogram 
are consistent estimators of the spectral density. 

In the literature, the pairwise tail dependence coefficients (|1.4p are mostly considered for con- 
crete examples of distributions, such as elliptical ones, including the multivariate t- and Gaussian 
distributions; see e.g. McNeil et al. [42]. In these cases, one can verify that the limits in (|1.4|) 
exist. For a Gaussian stationary sequence, p{h) = 0, h> 1, unless Xt = X a.s. for all t e Z. The 
case p{h) = for some /i > 1 is (again) referred to as asymptotic extremal dependence in the vector 
{X[j,Xh) although no extremal index is in view. 

In general, it is not obvious whether the limits p{h) and. more generally, ^AB{h) for h> exist. 
In this paper, we will use the notion of a regularly varying stationary sequence. It is a sufficient 
condition for the existence of the limits jAB{h). Roughly speaking, a regularly varying sequence 
of random variables (Xt) has power law tails for every lagged vector {Xi, . . . , Xh), h > 1. In what 
follows, we make precise what regular variation means. 

1.2.1. Regularly varying random vectors. The notion of regular variation is basic in extreme value 
theory and limit theory for partial sums of iid random variables. In multivariate extreme value 
theory, regular variation with index a > of the d-dimensional iid random vectors Xt, t S R, with 
values in (0, oo)'' is necessary and sufficient for the fact that the normalized sequence of component- 
wise maxima (a~^ max(<„ Xt*^)i=i_..._d, t = 1,2,..., converges in distribution to a d-dimensional 
extreme value distribution H on (0,oo)'' whose marginal distributions are Frechet ^a-distributed; 
see Resnick |56j for a general theory of multivariate extremes for iid sequences. Similarly, for a 
general M'^-valued iid sequence {Xt), the sequence of suitably normalized and centered partial sums 
a~^{Xi -|- • • • -|- Xn — bn) converges in distribution to an infinite variance a-stable limit if and only 
if the distribution of X is regularly varying with index a. The index a is then necessarily in the 
range a £ (0, 2). We refer to Rvaceva [62] and Resnick [57] for proofs of this fact. 

Various definitions of a d-dimensional regularly varying vector X exist; we refer to Resnick 
[55l|56l|57]. We start with a definition in terms of spherical coordinates. We say that X is regularly 
varying with index a > and spectral measure P{Q G ■) on the Borel cr- field of the unit sphere 
gd-i ::^{xeR'^: \x\ = 1} if0 the following weak limits exist for every fixed t > 0: 

Relation (|1.6p can be written in an equivalent form as a pair of conditions: 



The choice of the norm | ■ | is relevant for defining the corresponding unit sphere and the spectral measure on it, 
but the notion of regular variation of a vector does not depend on a particular choice of norm. In this paper, | ■ | will 
stand for the Euclidean norm. 
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(1) The norm \X\ is regularly varying in the classical sense, i.e. > tx)/P{\X\ > x) — > t^", 
t > 0, or, equivalently, P(|X| > x) = x^°'L{x), a; > 0, for a slowly varying function L; cf. 
Bingham et al. [21 for an encyclopedia on regularly varying functions. 

(2) The angular component is independent of \X\ for large values of \X\ in the sense that 

(1.7) P{x/\x\e-\\x\>x)^P{ee-), x^oo. 

In any of these limit relations, it is possible to replace the converging parameter a; by a sequence 
(a„) such that P(|X| > a„) ^ n~^. Then (|1.6p and (|1.7p . respectively, read as 

nP{\X\> tan,X/\X\e ■) ^t-"P{e e ■) and P(X/|X| e • | |X| > a„) 4 P(e £ •) • 

The convergence relation ()1.6|) can be understood as convergence on the particular Borel sets {x G 
M.'^ : |x| > x/|x| g S} for Borel sets S C S''"^ with a smooth boundary. This convergence can be 

extended to the Borel a-field on Iq = 1'' \ {0}, 1 = R U {oo, -oo}: 

(1.8) ^(1^1^^) -^M-), -^oo. 

Here A refers to vague convergence of measures on the Borel cr-field on Mq, i.e. J-^d f d^x ./ 

as X ^ oo for any continuous and compactly supported / on Mg; see Kallenberg |36] . Resnick |56j . 
This means in particular, that the support of / is bounded away from zero. In view of (jl.6p . 

/^({x e Ig : |x| > i,x/|x| e 5"}) = t-"P(e g S), and therefore /z is a Radon measure (i.e. finite 
on sets bounded away from zero) satisfying iJ.{tA) = t~°'fi{A), t > 0. In particular, fi does not 
charge points containing infinite components. Again, the parameter x in (|1.8p can be replaced by a 
sequence (a„) satisfying P(|X| > a„) ^ n^^ and then we get 

riP(a,7^X G •) A /i(-) , n ^ oo . 

For an iid sequence {X-t ) with generic element X , the latter condition is equivalent to the convergence 
of the point processes 



where is a Poisson random measure with mean measure n and state space Mq; see Resnick 
[55l [56] . Since point process convergence is basic to extreme value theory, the notion of multivariate 
regular variation is very natural in the context of extreme value theory for multivariate observations 
with heavy-tailed components; see also the recent monograph by Resnick [57] who stresses the 
importance of the notion of regular variation as relevant for many applications in finance, insurance 
and telecommunications]^ 

1.2.2. Regularly varying stationary sequences. A strictly stationary sequence (Xt) is regularly vary- 
ing with index a if its finite-dimensional distributions are regularly varying with index a, i.e. for 
every h > 1, there exist non-null Radon measures /i/,. on the Borel ct- field of Mq and a sequence (a„) 
such that On — > oo and 

(1.9) nP{a-\Xi,...,Xh)e-)^ ^ih{■), n^oo. 



■^The notion of regular variation is essentially dimensionless; see for example relation 1 11. 61 1 which immediately 
extends to normed spaces and, more generally, to metric spaces. An account of the corresponding theory can be 
found in Hult and Lindskog |32j . Applications of regular variation in function spaces to extreme value theory can be 
found in de Haan and Tao 1261 . Davis and Mikosch 1121 . Meinguet and Segers I43| . to large deviations in Hult et al. 
[33] . Mikosch and Wintenberger [49l[50], and to random sets in Mikosch et al. [iillis] . 
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Here and in what follows, we will choose the normalizing sequence (a„) such that -P(|X| > a„) ~ 
n~^, where we use the notation (a„) in a way different from Section 11.2.11 Indeed, in the latter 
section we defined the normalization (ai''-') such that P{\{Xi, . . . ,Xh)\ > al'''') ~ n~^, but then the 
normalization would depend on the dimension h. This is not desirable. However, notice that 

71^00 Pi\X\ > a„) 

Therefore, by the properties of regularly varying functions, there exist positive constants c^^" = 
lim„_^oo o-n/oLn^ , h > 1. Hcncc for sets A bounded away from zero such that fih{dA) = we have 
nP{{al^'>y\X,, ...,Xh)eA) = nP(a-i(a„/aW)(Xi, . . . £ A) 

^ Ch[nP{a~\Xu...,Xh)eA)] 

ChlJLh{A), 

i.e. the limit measures of regular variation under the different normalizations only differ by some 
positive constants. 

The condition of regular variation on the sequence {Xt) seems to be a severe restriction since 
the tails of the marginals are power laws. However, following Resnick |56] . Proposition 5.10, any 
multivariate distribution (with continuous marginals) in the maximum domain of attraction (MDA) 
of a d-dimensional extreme value distribution can be transformed to a distribution G with common 
Frechet or Pareto marginals. Then G is in the MDA of an extreme value distribution with Frechet 
marginals or, equivalently, G is regularly varying. 

For example, transforming the marginals of a Gaussian stationary sequence to unit Frechet, the 
resulting sequence is regularly varying with index a = 1. We mentioned before that the tail depen- 
dence coefficient p{h) = 0, ft. > 1, for any non-trivial Gaussian stationary sequence. The quantities 
p{h) remain invariant under monotone increasing transformations of the marginals. Hence, the 
transformed Gaussian distribution with unit Frechet marginals exhibits asymptotic independence 
in the sense that the limit measures ph are concentrated on the axes. 

The tail process. An insightful characterization of an ]R''-valued regularly varying stationary se- 
quence (Xt) was given in Theorem 2.1 of Basrak and Segers [2]: there exists a sequence of M'^-valued 
random vectors {Yt)tez such that P(|l^o| > v) = 2/~" for y > 1 and for any h > 0, 

P{x-\X^h,.-.,Xh) e • I \Xo\ > x) ^ P{{Y^h, ■ . ■ ,Yh) e a:^oo. 
The process (Yf) is the tail process of {Xt). Writing 8t — Yt/\YQ\ for t E Z, one also has for h > 0, 

P{\Xo\-\X^h,.-.,Xh) G • I \Xo\>x)^P{{e^h,-.-,eh) e •), a^^oo. 

The process (Qt) is independent of jYol a-nd called the spectral tail process of (Xt). Notice that 
P(9o e •) is the spectral measure of X. 

Basrak and Segers [2] also gave an expression for the extremal index in terms of the spectral tail 
process: 

(1.10) 6i = £;[sup|et|"-sup|et|"] . 

t>0 t>l 

1.2.3. The extremogram revisited. Now consider an R'^-valued regularly varying stationary {Xi). 
Then the extremogram ^AB{h)^ ft > 0, is well defined. Indeed, for every ft > 0, the vector 
{Xi, . . . ,X/i+i) is regularly varying with limit measure pu+i- Then, with normalization (a„) such 
that P{\X\ > a„) - 

nP{a-^Xh e B,a-^Xo E A) ^ Ph+M x M'^C'-i) xB)=-fAB{h), ft > 0, 
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provided A -B is a continuity set with respect to the measure jih+i- Similarly, for d = \ 

and A = B = {1, oo), 

' tih+i{AxW^) 

These limits can also be expressed in terms of the tail process. In the former case, assuming that 
A is bounded away from zero, there exists i5 > such that A d {x ^W'- : \x\ > 5}. Hence 

P{a-^Xh e B, g-^Xp £ A) 
P{\X\ > a„) 

Pjq-^Xh £ B,a-^Xo £ A, \Xo\ > San) 
P{\X\>an) 

P{{6an)-^Xh £ 6-^B, i6an)-^Xo £ 5-^ A, \Xo\ > Sa^) P{\X\ > fe„) 
P{\X\ > San) P{\X\ > an) 

^ P{{Yo,n)eS-\AxB))5~'^ 
= P{{Yo,n) e A X B) = jAB{h) . 
Similarly, for d = 1 and A = B = (1,c>d), assuming that liuix^oa P{X > x)/P{\X\ > x) ^ 

E{eo)% = p(eo = 1) > 0, 

P{Xh > an,Xo > an) ^ PjXh > an,Xo > a„, \Xo\ > an) P{\X\ > an) 
P{X > an) ^ P{\X\ > an) P{X > an) 

^ PiYh>l\Yo>l) 

p(|ro|min(eo,e,o > i) 



PdFoieo > 1) 

= P{h) 



E{mm{&o,Qh))1 



E{eo)% 

1.2 A. Examples of regularly varying sequences and their extremograms. In this section, we will intro- 
duce some important classes of real- valued strictly stationary regularly varying stationary sequences 
with index a > 0. We will also give the values of the extremogram p{h), ft, > 1, in (|1.4p . For the 
calculation of p in these examples, we refer to [T71 [5Tj . 

IID sequence. An iid sequence (Zt) is regularly varying with index a if and only if Z is regularly 
varying with the same index; the limit measures ph are concentrated on the axes and p{h) = 0, 
h>l. 

Linear process. Historically, the class of linear processes with regularly varying iid real- valued noise 
(Zf) has attracted attention in extreme value theory and in time scries analysis. A (causal) linear 
process 



oo 



(1.11) Xt = }_^^p,Zt-j, tez, 

t=0 

inherits regular variation under conditions on the deterministic sequence ("0^) which are close to 
those dictated by the 3-series theorem, ensuring the a.s. convergence of the series in (jl.lip . This 
fact was proved in Mikosch and Samorodnitsky }47j for the distribution of X. The regular variation 
of the finite-dimensional distributions of (Xt) follows since regular variation is preserved under affine 
transformations of regularly varying vectors. The class (jl.lip includes causal ARMA processes which 
are relevant for applications. We refer to Chapter 7 of Embrcchts et al. [50] for various applications 
of regularly varying linear processes. 
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Under the tail balance condition P{Z > x) ^ pP{\Z\ > x), P{Z < -x) ~ qP{\Z\ > x), as 
a; — )■ oo, for some p, g > with p + q = 1, 



p{h) 



2^i=0 


p (niin(V'+, V-i^/J)" + q (min(V'i , i}i+h)T 









h> I. 



Stochastic recurrence equations. Next to linear processes, solutions to the stochastic recurrence 
equation 

(1.12) Xt= AtXt-i+Bt, tel., 

have attracted some attention. Here {At,Bt), t G Z, is an iid R^-valued sequence. An a.s unique 
causal solution to (|1.12p exists under the moment conditions ElogA~^ < and i?log|i?| < oo. It 
follows from work by Kesten |37] and Goldie [3 7) that X is regularly varying in the precise sense 
that 

P{X > x) ^ and P{X < -x) - c_a;"" , a; ^ oo , 

for constants c+ , c_ > such that c-|_ + c_ > provided the equation 

(1.13) E\A\'' = 1 

has a positive solution a (which is unique due to convexity), EB°' < oo and further regularity 
conditions on the distribution of A are satisfied. Iteration of (|1.12p shows that the finite-dimensional 
distributions of (Xt) are regularly varying with index a. This fact is rather surprising since the 
distributions of A and B do not need to be heavy-tailed, in contrast to linear processes , where the 
noise (Zt) itself has to be heavy-tailed to ensure regular variation of (Xt). We mention that the 
case of multivariate B and matrix- valued A has also been studied, starting with Kesten |37j : see the 
recent paper Buraczewski et al. [7]. 

Assuming A > a.s., similar calculations as in the proof of Lemma 2.1 in [16] yield 

(1.14) p(/i) = £:[min(l,yli---yl,,)"], h>l. 

Models for returns. Log-returns Xt = log Pt — logPt_i, < £ Z, of a speculative price series (Pt) are 
often modeled of the form Xt ~ o^tZt, where (at) is a strictly stationary sequence of non- negative 
volatilities and (Zt) is an iid multiplicative noise sequence. The feedback between (at) and (Zt) can 
be modeled in a rather flexible way. 

Stochastic volatility models. The most simple approach is to assume that (at) and (Zt) be inde- 
pendent. The resulting time series model is frequently referred to as stochastic volatility model. 
Its probabilistic properties are rather simple; see Davis and Mikosch |15j . In particular, regular 
variation of (Xt) results if Ea"'^^ < oo for some S > and (Zt) is iid and regularly varying with 
index a. The corresponding limit measures fih in (|1.9p are then concentrated on the axes; see Davis 
and Mikosch [TT1[T3]Q and then also p(/i) = 0, /i > 1, as in the iid case. The situation changes if 
E\Z\°'~^^ < 00 for some (5 > and (at) is regularly varying with index a. Then (Xt) is regularly 
varying with index a and extremal clustering for this sequence is possible; see Mikosch and Rezapur 

Eel. 



^The fact that /iji, /i > 1, is concentrated on the axes is also referred to as asymptotic extremal independence. Recall 
that, in an extreme value context, various other situations are also referred to as asymptotic extremal independence, 
among them the cases of unit extremal index and zero tail dependence coefficient. Asymptotic independence in the 
sense of the limiting measures fi^ is much more complex than the other notions which are just numerical characteristics. 
The fact that /i/i is concentrated on the axes means that it is very unlikely that any two values Xt and Xs, s t, 
are big at the same time, just as for independent random variables. On the other hand, this kind of asymptotic 
independence heavily relies on the notion of multivariate regular variation. 
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GARCH model. Among the models for returns Xt = atZt, t ^ "L, the GARCH family gained most 
popularity. The simplest model of its kind (ARCH) was introduced by Engle [21] and the more 
sophisticated GARCH model by BoUerslcv For simplicity, we consider the GARCH (1,1) case 
given by cr^ = q;q + a^_l{alZ^_^ + t £ "L, where Q!o,ai,/3i are positive constants with certain 
restrictions on the values of ai + /3i, /3i < 1, to ensure strict stationarity. Typical choices are 
standard normal or unit variance t-distributed Z. Notice that we can write — At<Jt_i + Bt, 
where Bt = ag and At = aiZf_i + /3i, t € Z. Therefore regular variation of (at) follows from 
the corresponding theory for stochastic recurrence equations, and (Xt) inherits the same property; 
see Davis and Mikosch [10] for the ARCH(l) case, Mikosch and Starica [48] for the GARCH(1, 1), 
Basrak et al. [Tj for GARCH(p, g) and the review paper Davis and Mikosch [T^. Real- life log- 
returns are typically heavy-tailed. The GARCH model captures this property and this was one of 
the reasons that it became a benchmark model in financial time series analysis from which numerous 
other models where derived. An expression of p for (X^) is given by (|1.14p with At = aiZ^ + (3i. 

Infinite variance stable sequence. Stable processes with infinite variance have become popular due 
to their attractive theoretical and modeling properties; see Samorodnitsky and Taqqu |64j . The 
finite-dimensional distributions of an a-stable process are jointly a-stable, hence they are regularly 
varying with index a G (0,2). The class of infinite variance stationary stable processes has been 
intensively studied; see Rosihski [61]. An expression of p is given in [16], Section 2.4. 

Max-stable processes. This class of processes has recently attracted some attention since it is a 
flexible class for modeling heavy tails and spatio-temporal dependence. Since the finite-dimensional 
distributions of max-stable processes are explicitly given it is often simple to verify properties (such 
as regular variation) and to calculate certain quantities (e.g. mixing coefficients, extremal index). 
We will use this class of regularly varying processes to illustrate the general theory. 

Following de Haan [53], a real-valued process {£,t)t£T, T C M, is a-max-stable for some a > if 
its finite-dimensional distributions satisfy the relation 

PiCti < xi,. . . ,^ta < Xn) = exp|-/ max(— )"rt^(ds)| , 

e r,i = i,2,...,d, X, >o, d>i, 

where Ft^ arc finite measures on the unit sphere. This means in particular that the marginal 
distributions of the process ^ have a Frechet distribution with parameter a given bj0 

(1.15) $„(a;) = e-^"° , a; > . 

Dc Haan |23| also introduced the notion of a-max-stable integral. Given a cr-finitc measure space 
{E,E,v), consider a Poisson random measure 'Yl7Li^(Ti,Yi) with < Fi < r2 < • • • on the state 
space M+ x E with mean measure LEB x v. For / > with / G L°'{E,£, v) the max-stable integral 
is defined as 

r fdM^ =svipT-'^^ f{Y,) . 

J E i>l 



The choice of Frechet <I>c< marginals is for convenience only; then results on regular variation are applicable. 
Since Gumbel or WeibuU distributed random variables can be obtained by suitable increasing transformations of a 
Frechet random variable any result for max-stable processes with Frechet marginals can be formulated in terms of 
the transformed processes with Gumbel or WeibuU marginals; see for example Kabluchko et. al [35) who formulated 
their results in terms of Gumbel distributions. Since the choice of the parameter a is also arbitrary in this context, 
most results in the literature are formulated for processes with unit Frechet $1 marginals. 
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Using the order statistics property of the homogeneous Poisson process with points (Fi), one obtains 

/•V /• 

(1.16) P( / f dM^ < x) ^ cxp [- X-"' riy)i^idy)}, x>0. 

Moreover, for any non-negative fi £ L"-{E, £, z^), i = 1, . . . , d, by (|1.16p . 



V 



P[ / f^dM^ <x.,,i = l,...,d) = P{ I max — dM^ < I 



E ^ ^ J E 

(1.17) = expj - /" max (^^^)"i/(dy)} , x^ > ,i ^ 1, . . . ,d . 

The notions of max-stable process and integral bear some resemblance with the corresponding a- 
stable ones; see Stoev and Taqqu [67], Kabluchko [34] . 

We will focus on stationary ergodic max-stable processes with integral representation 

(1.18) Xt^ f ftdM^, teR, /t>0, fteL''iE,£,iy). 

J E 

As in the case of a-stable stationary ergodic processes (Rosihski [HI]), the choice of {ft) is rather 
sophisticated; see Stoev [66], Kabluchko [34] for details. De Haan [23] showed that any max- 
stable process with countable index set T C M and stochastically continuous sample paths has 
representation (|1.18p and Kabluchko [33] proved this fact for any max-stable process on T for 
sufficiently rich measure spaces {E, £, v). In what follows, we will always assume that the considered 
max-stable processes have representation (|1.18p . 

Next we give some basic properties of a stationary max-stable process. 

Proposition 1.3. The following statements hold for the skeleton process {Xt)tez of the process 
(HHl). 

(1) The finite- dimensional distributions of (Xt) are regularly varying with index a and the limit 
measures of the finite- dimensional distributions are given by its values on the comple- 
ments of the rectangles (0, x] = {y G R'' : < < , i = 1, . . . , /i}, /i > 1, x = (xi, . . . , x^) 
with Xi > 0, i — 1, . . . ,h: 



(1.19) M/>((0,: 



lEfoiy)Hdy) 

(2) The sequence (Xt) has extremal index 6 if and only if the limit 



(1.20) 61 = lim - 



1 maxt=i,._„ /i"(y) v{dy) 



n^oon jEfSiy)^i'^y) 



exists. 

(3) The extremogram for the sets A = (a, oo) and B = (&, oo), a, 6 > 0, is given by 

n^n . (h^ i'EfSiy)^{my)T-idy) 



and, for a = b = 1, 



lEfoiy)^fhiy)'^idy) 



(1.22) p(fe)^ -^^-y- ^ , h>i. 

JEfoiyi'^idy) 
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(4) Let Si,S2 be finite disjoint subsets of Z and a{C) the a-field generated by {Xt)ti=c for any 
C (zTi. Recall the a-mixing coefficient relative to the sets Si,S2- 

a{Si,S2)= sup \P{Ar\B)- P{A)P{B)\. 

Aea{Si),Becr{S2) 

and for = {..., —1, 0}, S}^ = {h, h + 1, . . .}, h > 1, introduce the mixing rate function 
Then there exists a universal constant c > such that 

oo „ 

(1.23) an<c E / /o"(2/) A (y) ^^2/) , > 1 ■ 

Remark 1.4. If the limit in (|1.20p exists it belongs to the interval [0, 1]. Indeed, by stationarity of 
/ niax /f(y)Kdy)<E / /^(y)^^?/)^" / foiy)'^{dy). 

J^t=l,...,n 7^1 J E Je 

Remark 1.5. Part (4) is a consequence of Corollary 2.2 in Dombry and Eyi-Minko [19] proved 
for /3-mixing. If foiv) A f^{y) i^{dy) < cq e "'^^ , /i > 1, for some constants co,ci > 0, then we 
conclude that ah < Ce~'^^ for some C > 0. 

Proof. Part (1) Since the integrals fidAI", i = 1, . . . ,h, are supported on (0, oo) it suffices to 
show that there exists a non-null Radon measure ii^ on the Borel cr- field of Rq n (0, oo]'' such that 

(1.24) nP{a-\X,,...,Xh)) G [0, x]^) ^ a*/. ([0, x]^) , 
where x is chosen such that [Ojx]"^ is a /x/j-continuity set and 

PiX > a„) - 1 - exp{ - a;^" / f^{x)iy{dx)} ^ , 

JE 

see Resnick [57] . Theorem 6.1. A Taylor expansion argument shows that wc can always choose 

an / /o"(x)Krf^))'/". 

Je 

An application of (|1.17p and a Taylor expansion yield (|1.24p with limit as specified in ()1.19p . 
Part (2) Applying (|1.17p for Xi = x > 0, we obtain 

Pia-'M„ <x) = cxp f - a-'^x-" f max (y)z/(dy)| . 

By definition of the extremal index, the right-hand side must converge to ^^ai^) for some 6 e [0, 1]. 
Equivalently, the limit in ()1.20p exists. 

Part (3) As regards the extremogram for sets A = (a, oo), B = {b, oo), a,b > 0, wc have the relation 
P{Xh >bx,Xo>ax) = P{Xh >bx) + P{Xo >ax)- P{{Xhlb) V (Xo/a) > x) 

= 1 - cxp { - X-- (Mm))" _ exp { - X-- {IMy ,i^dy)} 

+ exp{-x-"/^(M^)"vM^)%(rfy)}. 

In view of stationarity, fhiv) ^i'^v) ~ Je foiu) ^{'^v)- Using a Taylor expansion as a; — ?> oo, we 
obtain the desired formulas (|1.2ip and (|1.22p 
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Part (4) Wc obtain from Corollary 2.2 in Dombry and Eyi-Minko [T^ for any disjoint closed 
countable subsets 5*1 , 52 of M 

Then (|1.23p is immediate. □ 

Next we consider two popular models of max-stable processes. 
Example 1.6. The Brown-Resnick process (see [B]) has representation 

(1.25) Xt = suprri/"e^'(*)-°-5'^'(*), teM, 

i>l 

where (r^) is an enumeration of the points of a unit rate homogeneous Poisson process on (0,oo) 
independent of the iid sequence (Wi) of sample continuous mean zero Gaussian processes on M 
with stationary increments and variance function a^. The max-stable process (|1.25p is stationary 
(Theorem 2 in Kabluchko et al. [35]) and its distribution only depends on the variogram V{h) = 
vaT{W{t + h) - W{t)), t eR,h> 0. It follows from Example 2.1 in Dombry and Eyi-Minko [H] 
that the functions {ft) in representation (|1.18p satisfy the condition 

/ foiy) A f;:iyMdy) < cmbVnh)) , 

Je 

where $ is the standard normal distribution. For example, if W is standard Brownian motion, 
V{h) = h, $(0.5 ^ ce "''/^ft."" '^; as /i ^ oo. An application of Remark 11.51 shows that (ah) 
decays at an exponential rate. 

Recently, the Brown-Resnick process has attracted some attention for modeling spatio-temporal 
extremes; see [Ml [351 IBHl [S2] ■ The processes (|1.25p can be extended to random fields on M''. These 
fields found various applications for modeling spatio-temporal extremal effects; see Kabluchko et al. 
[35] . For further spatio-temporal applications of max-stable random fields, see also Davis et al. [9]. 

Example 1.7. We consider de Haan and Pcreira's |25] max-moving process 

(1.26) Xt = sup r7i/"/(t -[/,), teR, 

i>l 

where / is a continuous Lebesgue density on M such that snp^^^i f{x+h) dx < oo and X^i^i ^{ri,Ui) 
are the points of a unit rate homogeneous Poisson random measure on (0, oo) x R. 

The resulting process (Xt) is a-max-stable and stationary. According to Example 2.2 in Dombry 
and Eyi-Minko [T9], 

/ foiy) A fhiyMdy) < c [ mm{f{-x)J{h ~x))dx, h>0. 

J E Jr 

For example, if / is the standard normal density, this implies that (ah) decays to zero faster than 
exponentially, i.e. the memory in this sequence is very short. In Figure [1.2.41 a simulation of the 
corresponding process (|1.26p for a = 5 is shown. 

2. Estimation of the extremogram 

2.1. Asymptotic theory. Natural estimators of the extremogram are obtained by replacing the 
probabilities in the limit relations (|1.3|) and (|1.4|) with their empirical counterparts. In this context, 
one works with quantities which are derived from the tail empirical process; see the monographs de 
Haan and Ferreira [24], Resnick [57] for the underlying theory. For the introduction of the sample 
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Figure 1.8. Max-stable process (|1.26p where / is tlie standard normal density and a ~ 5. Extremal 
clusters are clearly visible. 



extremogram, consider an M''- valued strictly stationary regularly varying process (Xt) and a Borel 

set C C Mg bounded away from zero. Then, for any sequence m = m„ — > oo with m„/n — > as 
n — oo, we define the following estimator of Pm{C) = m P{a~^ X £ C): 

n 
t=l 

A possible choice of (a™) is given by -P(|X| > ~ m~^. By definition of regular variation of X, 
for any /xi-continuity set C, 

Here the condition to„ — ^ oo as n — >■ oo was crucial for asymptotic unbiasedness. For the calculation 
of the asymptotic variance of Pjn{C) we assume the following condition: 

(M) The sequence {Xt) is a- mixing with rate function (a^) and there exists a sequence r„ — > oo 
such that Tn/mn — )■ as n — > oo. 



(2.1) lim TO„ a/i = 

n— >oo ^ — ^ 

h—Tji 

and for every e > 0, 

(2.2) lim limsupm„ ^ P{\Xh\ > tam, \Xo\ > eam) = . 



h=k 



This condition is technical: (|2.ip imposes some rate on the mixing function (a/i) and (j2.2l) avoids 
"extremal long range dependence" ; (|2.2p is an asymptotic independence condition in the spirit of 
the classical condition D'\ see Leadbetter et al. [ID], Embrechts et al. [50]. The quantities to„ 
and r„ have some straightforward interpretation as size in large-small block scheme: the sample 
Xi, . . . ,Xn consists of roughly [n/nin] large disjoint blocks of size nin- After chopping off the first 



16 



R.A DAVIS, T. MIKOSCH, AND Y. ZHAO 



r„ elements in each large block one aims at ensuring the asymptotic independence of the resulting 
large blocks. 

If C is a /ii-continuity set and C x Mq'''' x C are ^h+i-continuity sets for every /i > 1, regular 
variation of X implies 

^ 777 

(2.3) var[P„,(C)]^-nC^), 

n 

where 

CXD 

F(C) =Aii(C) + 2 ^T^(C) and r^(C) = Mh+i(C x Iq^""'^ x C) , h>\, 

h=l 

and we also assume that the infinite series is finite. The asymptotic relation (|2.3p indicates that the 
condition rrin/n — > is needed to ensure the consistency of the estimator Pm{C). Under additional 
conditions, {Pm{C)) is asymptotically normal and this property also holds jointly for finitely many 
sets Ci, . . . ,Ch- The complicated form of the asymptotic variance in (|2.3p suggests that it is difficult 
to apply this central limit theorem for constructing asymptotic confidence bands. 
The motivating examples of extremograms are limits of conditional probabilities 

PAsih) = lim Pix-^Xh e B \ x'^Xo e A) , h>0, 



for sets A, B bounded away from zero. In Section 11.21 we mentioned the close relationship of pab 
with a cross-correlation function. Replacing the probabilities in these conditional probabilities by 
estimators of the type P^rt (C) and applying the corresponding central limit theory from |16| , Section 
3, and the continuous mapping theorem, one obtains an asymptotic theory for the ratio estimators 

PabW^ ^ — J , h>Q. 

The latter estimators only depend on the high threshold am which one typically chooses as a high 
empirical quantile of the data. These estimators can be interpreted as a sample cross-correlation 
function. 

Wc recall a central limit theorem for these estimators; see Corollary 3.4 in [16] and its correction 
Theorem 4.3 in [H]. 

Theorem 2.1. Let {Xt) he an -valued strictly stationary regularly varying sequence with index 
a > 0. Assume that the following conditions are satisfied. 

• The Borel sets A, i? C Rq are bounded away from zero and pi{A) > 0. 

• The sets A, B are continuous with respect to pi . 

• Condition (M), (n/m„)Q!r„ — > as n ^ oo. 

• m„ = o[n^/^) or 

(2.4) HblYai^O and asn-^oo. 

j=rn 

Then the following central limit theorem holds for h > 

(2.5) .[^\pAB{h)~PABMh)\ ^N{0,{p,{A)y^-E). 

for some matrix S0 where pAB,m{h) ~ P{a'^Xh G B \ a^^j^Xo G A). 
Some comments are here in place. 



^This matrix is complicated and irrelevant for our purposes; see |16| , (3.15) and (3.16) for its value. 
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The conditions of this result are rather technical but the mixing and anti-clustering condi- 
tions can be verified for standard time series models. For example, if (Xt) is a-mixing with 
geometric rate, then one can simply choose sequences r„ = [Clogn] for a large constant 
C > or r„ = n*^, and to„ = n^*^ for suitable small e > 0. 

The asymptotic variance is not of practical use. Therefore Davis et al. [17] suggest an alter- 
native way of constructing confidence bands for pAsih), by using the stationary bootstrap 
introduced by Politis and Romano [54] . 

Central limit theory and bootstrap consistency for the sample extremogram do not follow 
from standard results for sequences of mixing stationary sequences. This is due to the fact 
that we deal with sequences of indicator functions (^{a-iy-jgc}) ^'-'^ certain strictly stationary 
sequences (Yf) and sets C bounded away from zero. The sequences (/|^-iy^g|^-j) constitute 
a triangular array of row-wise strictly stationary sequences for which, to the best of our 
knowledge, standard asymptotic theory is not available. 

The convergence rate y^n/m in the central limit theorem p.Sp is due to the triangular array 
structure; it can be significantly slower than standard y/n-raies. 

We call pAB,m in ()2.5p the pre- asymptotic extremogram since, in general, one cannot replace 
the centering constants pAB,m{h) by their limits pAsih); see Example 12. 21 below. Moreover, 
it is in general very difficult to show that 

\PAB.Tn{h) - pABih)\ ^ , as n oo, 

even for "nice" models such as the GARCH(1,1). For this well studied model one lacks 
precise information about the tail behavior. The central limit theorem (j2.5p (and related 
bootstrap procedures) arc then used to approximate the conditional probabilities pAB,m{h). 
These have a very concrete interpretation in contrast to their less intuitive limits pAB{h). 
If (j2.6p holds with a rather slow convergence rate one faces a bias problem. This problem 
can be observed e.g. for a simulated stochastic volatility process (Xt) and A = B = (l,oo). 
Then p(h) = pAA{h) ~ for h > 1. If {Xt) is a- mixing with geometric rate it can be verified 
that (|2.6p and (|2.4p hold for to„ = n"^ , 7 £ (1/3, 1), and then (|2.5p applies with PAA.m{h) 
replaced by p{h) = 0; see [1^], Section 4.2. Also notice that pAA{h) is of the order 1/m in 
the iid case and hence greater than zero. 

The formulation of the results in (THl HZl (TB] related to Theorem 12.11 involve various other 
continuity conditions on sets. These conditions can be avoided as a close inspection of the 
proofs shows: these conditions follow from continuity of A and B with respect to pi. Indeed, 

one needs that sets of the form 0*1 Q are //fc- continuity sets, where Ci G {A, B, Mg, Ani?}, 

i = 1, . . . , fc, and at least one of the sets Gt does not coincide with Mq. Let S be the set of 

indices i such that C,; does not coincide with Mq. Then 

k 

5((g)Q) C U(lo)'-' X da X (lo)'^-^-' . 
1=1 ies 

The sets in the union have /x^.-measure zero. For the sake of argument assume that i = 1 E S. 
Then 

lim nP{a-\Xi,...,Xk) G dCi x (lo)'^'-^) - lim nP{a~^Xi G dCx) 

n— J-cxo n—foo 

= pi{dCi) = Q. 




In applications one needs to choose the threshold am in some reasonable way. The choice of 
a threshold is inherent to extreme value statistics to which no easy solution exists. In [16] we 
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advocated to choose fixed high/low empirical quantile of the absolute values of the 

data and to experiment with several quantile values. If the plot of the sample extremogram 
is robust for a range of such quantilcs one can choose a quantile from that region. The 
theory in |161 117[ [51] is based on deterministic values The heuristic method described 
above advocates the choice of a data dependent threshold. Recent work by Kulik and Soulier 
[38j yields a theory for a modified sample extremogram with data dependent threshold in 
the case of short and long memory stochastic volatility processes. 

Example 2.2. In [12] we did not provide a concrete example of a sequence (Xt) for which (|2.6p 
does not hold. Such counterexamples can easily be constructed from max-stable strictly stationary 
processes with Frechet marginals; see Section [1.2.41 We assume the conditions of Proposition [T31 
For simplicity choose a = 1, A = B = (l,oo) and a„j = m foi^) i^{dx). The function p is given 
in (|1.22p . By Taylor expansion, 

= p(/i)-m-ic,, (1 + 0(1)), 

for some constant 7^ 0. Hence 

{n/mX^\pAAMh) - P{h)\ ^ {n/m^f-^ , 

and the right-hand side converges to zero if and only if n^/"^ = o(m) and the rate of convergence to 
zero can be arbitrarily small. The latter condition is, of course, in contradiction with m = o{n^^^) 
which is one possible sufficient condition for (|2.5|) . Fortunately, the other sufficient condition (|2.4p 
can still be satisfied if m = o{n^^'^) does not hold. For example, if ah decays to zero at a geometric 
rate and one chooses r„ = [Clogn] for some C > and m„ = n'^ for some 7 6(1/3,1). Particular 
cases with geometric decay of (ah) were mentioned in Examples 11.61 and 11.71 

2.2. Cross-extremogram for bivariate time series. While the definition of the extremogram 
covers the case of multivariate time series, it is of limited value if the index of regular variation is 
not the same across the component series. For example, consider two regularly varying univariate 
strictly stationary time series (Xt) and (Yt) with tail indices ax < ay- Then, assuming {{Xt, Yt))t£'L 
stationary, this bivariate time series would be regularly varying with index ax, and for Borel sets 
A, B bounded away from zero, A = ^ x K and i? = M x S, 

p^g{h) = lim P{Yh e xB I Xo e xA) = lim P{{Xh, Yh) € xB \ (Xq, Ya)exA) = Q, heZ. 



The asymptotic theory of Section \2A\ is applicable to the sets A ^ A x R and B = R x B. In this 
case, no extremal dependence between the two series would be measured. To avoid these rather 
uninteresting cases and obtain a more meaningful measure of extremal dependence, we transform 
the two series so that they have the same marginals. In extreme value theory, the transformation 
to the unit Frechet distribution is standard. For the sake of argument, assume that both Xt and 
Yt are positive so that the focus of attention will be on extremal dependence in the upper tails. 
The case of extremal dependence in the lower tails or upper and lower tails is similar. Under the 
positivity constraint, if Fi and F2 denote the distribution functions of Xt and Yt, respectively, 
and are continuous, then the two transformed series, Xt = Gi{Xt) and Yt = G2{Yt) with Gi{z) = 
— 1/ log(Fi(z)), i = 1,2, have unit Frechet marginals $1; see (|1.15p . Now assuming that the bivariate 
time series {{Xt,Yt))t£i, is regularly varying, we define the cross-extremogram by 

PabW = lim PiYh exB\XoexA), heZ, 
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At first glance, this may seem inconvenient since transformations to unit Frechet marginals are 
required. If one restricts attention to sets A and B that are intervals bounded away from or finite 
unions of such sets the transformation simplifies: if a„ denotes the (1 — rt ~-'^)- quant ile of <J>i, then 
by monotonicity of Gi, {Xh S a„ A} = {X^ £ ax,nA} and {Y^ G a„ B} = {Y?, G ay^„-B}, where 
ax.n and aY,n are the respective (1 — n~^)-quantiles of the distributions of Xt and Ij. For sets A 
and B of the required form, the cross-cxtremogram becomes 

(2.7) p^gih) = lim P{Y,, e ay.nB \ Xo e ax,n A) . 

Thus we do not actually need to find the transformations converting the data to unit Frechet, 
only the component-wise quantiles, ax,n and ay^m need to be calculated. Clearly, this notion of 
extremogram extends to more than two time series. 

3. An EXAMPLE: Equity indices 

We consider daily log-returns equity indices of four countries: S&P 500 for US, FTSE 100 for 
UK, DAX for Germany, Nikkei 225 for Japan. Figure [3T1 shows the sample extremogram p = paa 
for the negative tails {A = B = {—oo, —1) with a„i estimated as the 96% empirical quantiles of the 
absolute values of the negative data) applied to 6,443 log-returns of the FTSE and S&P (April 4, 
1984 to October 2, 2009), to 4,848 log-returns of the DAX (November 13, 1990 to October 2, 2009) 
and to 6,333 log-returns of the Nikkei (August 23, 1984 to October 2, 2009)Q The solid horizontal 
lines in the plots represent 98% confidence bands. They correspond to the maximum and minimum 
of the sample extremogram at lag 1 based on 99 random permutations of the data. If the data 
were independent random permutations would not change the dependence structure: values pAsih) 
which arc outside the confidence bands indicate that there is significant extremal dependence at lag 
h. The sample extremograms for all four indices decay rather slowly to zero, with S&P the slowest. 
Among the four indices, the Nikkei displays the least amount of extremal dependence as measured 
by the extremogram. The top graphs in Figure [3Tl indicate extremal dependence in the lower tail 
over a period of 40 days. 

We assume that the log-return series are modeled by a GARCII(1, 1) process (see Section [1.2.41 
for its definition). Then we can estimate its parameters, calculate the volatility sequence (St) and 
the filtered sequence Zt — Xt/at, t ~ 1, . . . ,n. Figure \3l2\ shows the sample extremograms p for the 
filtered FTSE and S&P sequences. These plots confirm that much of the extremal dependence (as 
measured by the extremogram) has been removed. Hence the extremal dependence in the log-returns 
is due to the volatility sequences (crt), as suggested by the GARCH model. 

For a bivariate time series {{Xt, Yt))ti£z the sample cross-extremogram is given by 

En — h J 
_ t=l ^{Yt+heaY,mB,Xteax.mA} 
PaB\'^I ~ Y^n T ' 

where ax,m and ay^m are the (1 — m~^)-quantiles of the marginal distributions of X and Y , respec- 
tively. For applications, they need to be replaced by the corresponding empirical quantiles. 

We calculate the sample cross-extremograms for the pairs of the log-returns series, again for the 
negative tails, i.e. A = B = (— oo, —1), and ax.m and aY.m are chosen as the 96% empirical quantiles 
of the negative values of the corresponding component samples. Since the samples have different 
sizes we consider those periods for which we have observations on both indices. 



^As noted in the literature, the lower tails of returns tend to be heavier than the upper tails. Similar plots (not 
shown here) of the sample extremogram for the upper tails also reveal extremal dependence, but to a lesser extent 
than seen in the lower tails. 
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Figure 3.1. The sample extremogram for the lower tails of the FTSE (top left), S&P (top right), 
DAX (bottom left) and Nikkei. The solid lines represent 98% confidence bands based on 99 random 
permutations of the data. 



The sample cross-extremogram of any pair of series exhibits a similar pattern of slow decay as 
seen in the univariate sample extremograms (we do not include these figures). Figure [331 shows 
the sample cross-extremograms for the filtered series. For example, in the first row of graphs, [Xt) 
is the filtered FTSE and (Yj) are the filtered S&P, DAX and Nikkei, respectively. There are signs 
of various types of cross-extremal dependence in the filtered series. The spikes at lag zero (except 
between the Nikkei and S&P) indicate the strong extremal dependence of the multiplicative shocks. 
In the second row, there is evidence of significant extremal dependence at lag one for each sample 
cross-extremogram: given the S&P has an extreme left tail event in a shock at time t there will be 
a corresponding large left tail shock in the FTSE, the DAX and the Nikkei at time t = 1. Given 
the dominance of the US stock market, one might expect a carry-over effect of the shocks on the 
other exchanges on the next day. Since only marginal GARCH models were fitted to the data, it 
may not seem all that surprising that the filtered series exhibit serial dependence. We should note, 
however, that the dependence in the shocks does not appear to last beyond one time lag. 

4. A Fourier analysis of extreme events 

Classical time series analysis studies the second order properties of stationary processes in the 
time and frequency domains. The latter approach refers to spectral (or Fourier) analysis of the 
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Figure 3.2. The sample extremograms for the filtered FTSE (left) and filtered S&P (right) series. The 
bold lines represent 98% confidence hounds based on 99 random permutations of the series. 



time series. We mentioned in Section [TT2] that the extremograms 7^ = ^aa and pA = Paa for a set 
A bounded away from zero are covariancc and correlations functions of some stationary sequence. 
Then it is possible to study the corresponding spectral properties of the extremogram. Research in 
this direction was started in [IB] and continued in [5T]. We recall some of the results. Throughout we 
assume that (^t) is a M''- valued strictly stationary regularly varying sequence with index a > and 



A is a /ii-continuity set. In the comments following Theorem 12. II we mentioned the latter property 
also implies that the sets A x (Mg)'* x A are ^/j-continuity sets for ft, > 0, so the limits pA{h) exist. 

Assuming that pA is square summable, we consider the corresponding spectral density corre- 
sponding to PA (see Brockwell and Davis [5], Chapter 4): 

00 

/^(A) = l + 2^p^(/i)e-"'\ Ae[0,7r]. 

h=l 

A natural estimator of the spectral density is obtained if we replace the quantities pA{h) by the 
sample versions 

^ ... f Er=i''(4-^XteA-Po)(/a-ix,+,eA-Po) 

PA{h) = , h>l. 



where po = P{a7nX e A)E 



/„^(A) - 1 + 2 V Mh) e -'"^ = ^ \^ ^ 

'"■ v^n J- P ( A ) 

h=l — 2^t=l^{a-^Xt<£A} ^r,i(^) 



We will refer to InA and its standardized version InA as periodogram of the extreme event am A. 
Indeed, if we replaced the normalization m/n hy 1/n, I^a is the periodogram of the sequence of 
centered indicators {I^a'^XteA} ~ Po)- These sequences constitute a triangular array of row-wise 
strictly stationary sequences for which standard asymptotic theory for the periodogram does not 
apply; for an asymptotic theory of the periodogram of a strictly stationary linear process, see 



®The centering of the indicator functions with their expectation po is crucial for deriving asymptotic the- 
ory. In applications, /„yi(A) is typically evaluated at the Fourier frequencies i^j(ri) = 2nj/n G (0, vr) and since 
J2h=i e = 0, centering in J„A(iJj(n)) is not needed. 
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Figure 3.3. The sample cross-extremograms for the filtered FTSE, S&P, DAX and Nikkei series. For the 
first row, (Xt) is the filtered FTSE and (Yt) are the filtered S&P, DAX and Nikkei (from left to right). For 
the second, third and fourth rows, the Xt 's are the filtered S&P, DAX and Nikkei series, respectively. 



Brockwell and Davis [5], Chapter 10. However, the periodogram of extreme events shares some of 
the basic properties of the periodogram, as the following results from Mikosch and Zhao [51] show. 

Theorem 4.1. Let (Xt) be an M."^ -valued strictly stationary regularly varying sequence with index 
a > satisfying condition (M), A C Mq be a fii- continuity set and X];i>i PA{h) < co. 
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• Assume A G (0,7r) is fixed and Un ~ 27rj„/n, j„ G Z. is any sequence of Fourier frequencies 
such that ujn — > A. Then 

lim EInAW = lim ElnAi^Jn) Mi(^) /a (A) , 

• Assume in addition that the sequences (rUn), (rn) from (M) also satisfy the growth conditions 
{n/m)ar^ — > 0, and ran = o{n^^^). Let (Ei) be a sequence of iid standard exponential 
random variables. Consider any fixed frequencies < Xi < ■ ■ ■ < Xn < for some N > I. 
Then the following relations hold: 

(/nA(A,)),^l . jV ^ l^l{A){fA{X^)E,)^^^ 

Consider any distinct Fourier frequencies uJi(n) — > A; G (0, tt) as n oo, i = 1, . . . , iV. T/ie 
limits Xi do not have to be distinct. Then the following relations hold: 

{InA{uJ^{n)))^^^ J^ 4 Hi{A){fA{X,)E,)^^^^^^^ j^, 71 ^ OO , 

(I„A{^^^{n)))^^^^ 4 {fA{X^)E,)^^^^ n->oo. 

These properties are very similar to those of a strictly stationary weakly dependent sequence. 
The asymptotic independence of the periodogram of the extreme event UmA and consistency in the 
mean of /„a(A) give raise to the hope that pointwise consistent smoothed periodogram estimation 
of the spectral density /a (A) is possible. 

For a fixed frequency A G (0, tt) define 

Ao = min{27rj/r7, : 27rj/n > A} , and Xj = Aq + 2-Kj/n , |i| < s ■ 

(We suppress the dependence of Xj on n.) Assume that s = s,i — )■ oo and s,i/n — > as n — > oo. 
Consider the non-negative weight function {wn{j))\j\<s satisfying the conditions 

E|j|<,s ^n(j) = 1 and T.\j\<s ^lU) ^- as n -> oo . 
Introduce the corresponding smoothed periodogram 

fnAiX) = ^ UJn{j) I,ia{Xj), 

|j|<Sn 

Under the conditions of the second item in Theorem 14.11 and some further restrictions on the growth 
of (m„) and (ah) the following limit relations hold for a fixed frequency A G (0,7r), 

fnA{X) fii{A) fAiX) and /„^(A)/P™(A) 4 /^(A) . 

In Figure|3]we illustrate how the smoothed periodogram /„, (-00,-1) works for 5-minute log-returns 
of Bank of America stock prices. We choose a simple Daniell window with Wn{j) = l/(2s„ + 1) and 
Sn — 52. 
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